Corpus: war_wikipedia_2018_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 26792 p-
2 26044 s-
3 25886 c-
4 25007 a-
5 20214 m-
Top Character Bigrams
word rank frequency n-gram
1 7504 ma-
2 6757 ca-
3 6745 pa-
4 5698 co-
5 4952 Pa-
Top Character Trigrams
word rank frequency n-gram
1 3109 Par-
2 2990 sub-
3 2646 par-
4 2492 pse-
5 2035 tri-
Top Character 4-Grams
word rank frequency n-gram
1 2458 Para-
2 2454 pseu-
3 1877 Pseu-
4 1523 para-
5 1265 nigr-
Top Character 5-Grams
word rank frequency n-gram
1 2451 pseud-
2 1873 Pseud-
3 1018 longi-
4 859 brevi-
5 775 micro-
2624 msec needed at 2024-05-22 01:15